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Abstract 

Most physicists agree that the Einstein-Podolsky-Rosen-Bell paradox exemplifies much of the 
strange behavior of quantum mechanics, but argument persists about what assumptions underlie 
the paradox. To clarify what the debate is about, we employ a simple and well-known thought 
experiment involving two correlated photons to help us focus on the logical assumptions needed 
to construct the EPR and Bell arguments. The view presented in this paper is that the minimal 
assumptions behind Bell's inequality are locality and counterfactual definiteness, but not scientific 
realism, determinism, or hidden variables, as is often suggested. We further examine the resulting 
constraints on physical theory with an illustration from the many- worlds interpretation of quantum 
mechanics - an interpretation that we argue is deterministic, local, and realist, but that nonetheless 
violates the Bell inequality. 
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I. INTRODUCTION 



"Like all authors of noncommissioned reviews, [this author] thinks that he can restate 
the position with such clarity and simplicity that all previous discussions will be eclipsed. " - 
J. S. BelU 

In 1935, Einstein, Podolsky, and Rosen [EPR] argued that quantum mechanics cannot be 
a complete theory of the physical world. 2 They based their argument on an analysis of two 
separate but correlated particles, for which a characteristic of one particle can be determined 
by measuring the other particle.- Assuming the measurement of the second particle does not 
affect the first particle (no nonlocal effects), EPR claimed that because the characteristic 
of the first particle can be precisely determined without directly measuring it (that is, by 
measuring only the correlated particle), that characteristic must have a well-defined existence 
independent of the observer. In their words, it must be an "element of reality" and therefore 
ought to be described by any theory that claims to be complete. Because quantum mechanics 
does not in general describe the precise values of physical characteristics — it only makes 
statistical statements describing the likelihood of different values — quantum theory must 
therefore be incomplete. A properly complete theory, they implied, must contain extra 
"hidden variables" that would describe the particle characteristic precisely. 

In constructing this argument, EPR adhered to the common sense principle of "scientific 
realism" (that measurable characteristics of a physical system exist and are well defined, 
independent of any outside influence or observation). The "elements of reality" represent 
their view of that underlying realism. Furthermore, assuming the "elements" evolve in a 
predictable fashion, the principle of determinism (that complete knowledge of the current 
state of a physical system can be used to determine the future state of the system) is also 
valid. In this way, the EPR "elements" allow precise predictions of measurement results and 
offer a potential alternative to the statistical nature of orthodox quantum mechanics. For 
decades following the EPR publication debate raged over the validity of these principles and 
what kind of predictions should be expected from a physical theory, without providing any 
clear alternative to quantum mechanics. 

In 1964, John Bell published a related thought experiment^ that distinguished between 
quantum mechanics and all local hidden-variable theories. In John Bell's version of the 
EPR thought experiment, quantum mechanics and local hidden- variable theories predicted 
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statistically different experimental results. What had previously been a philosophical debate 
suddenly had testable consequences. Bell's experiment has since been performed in the 
laboratory, 5 verifying the predictions of quantum mechanics and dramatically contradicting 
Bell's predictions for local hidden-variable theories.- 

Since then, debate has focused on the foundation of Bell's predictions. Bell's original 
proof concentrates on theories with hidden variables (that is, variables that "determine pre- 
cisely the results of individual measurements,"-) which are inherently realist.- Some authors 
state that the derivation applies specifically to deterministic hidden variables.- 8 Other au- 
thors suggest that Bell's predictions rest more generally on local realism^ (without being 
necessarily deterministic) or on locality alone™' Broadly speaking, there are two camps of 
thought on this issue: there are those who think that Bell's inequality can be derived purely 
from the assumption of locality (and the experimentally measured correlations in the EPR 
experiment), and those who think that Bell's theorem requires some other assumption as 
well. 

In this paper we use a carefully broad definition of "locality" (see the following) and 
support the latter view. However, we argue that realism, determinism, and hidden variables 
do not provide the minimal necessary additional assumption, and that these assumptions 
have only incidental connection with the logic of the Bell derivation. Instead, following 
the comments of previous authors,— we claim that the minimal assumptions behind Bell's 
predictions are locality and "counterfactual definiteness" (that we can postulate a single 
definite result from a measurement even when the measurement is not performed), and we 
illustrate the subtle distinctions between these various assumptions with some examples. 

There is some risk in trying to clarify such a contentious issue. Although the role of coun- 
terfactual definiteness in the Bell analysis was identified as early as 1971,— it is not broadly 
accepted by the physics community, and there is much debate about the interpretation of 
the Bell experiment. In most cases, the abstract nature of the argument inhibits explana- 
tion to lay audiences and even confounds the discussion for professional physicists. To try 
to alleviate this confusion we focus on a simple concrete example to illustrate the logic in 
Bell's derivation and clarify the assumptions about locality, hidden variables, realism, deter- 
minism, and counterfactual definiteness. Hopefully, this approach can be used as the basis 
for explanation to a lay audience and to help clarify the concepts for professional scientists 
(as it has for this author). We conclude by briefly describing some popular interpretations 
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of quantum mechanics (focusing especially on the many-worlds interpretation) as a way of 
further illustrating the various assumptions about an underlying physical reality. 



II. THE EPR THOUGHT EXPERIMENT 

In their paper, 2 EPR explored the predictions for position and momentum measurements 
of correlated two-particle systems. In 1950, David Bohm reformulated the EPR experiment 
as measurements of spin 1/2 particles-^ instead of the continuous range of results from 
position or momentum measurements. In our approach we consider measurements of photon 
polarizations, which follow the approach of many real-life EPR experiments and are closely 
analogous to Bohm's thought experiment. Photon polarizations are easier to measure than 
the orientation of spin 1/2 particles — a polarizing filter 14 can be used instead of a Stern- 
Gerlach apparatus — and are easier to explain to the philosopher on the street because they 
don't require the concept of spin. The photons pass through or are absorbed by a polarizing 
filter, and the probability of a photon with a certain (linear) polarization passing through 
an ideal polarizing filter at a certain orientation can be calculated using simple geometry. 

We begin by imagining two photons that are traveling side-by-side in the same direction 
(but separated from each other), and which are prepared in the "twin state" polarization. 
In quantum mechanics the term "twin state" has a particular meaning, but our analysis of 
Bell's logic requires only an experimental definition. In practical terms, the "twin state" 
means that measurements of the two photons using identical polarizing filters always produce 
the same result. If the first photon passes through a polarizing filter, the second photon is 
guaranteed to pass through another filter oriented at the same angle. If the first photon is 
absorbed, the second photon will also be absorbed, no matter the orientation of the filters, 
as long as the two filters are pointing in the same direction. The results of the polarization 
measurements are perfectly correlated.— 

In quantum mechanics the twin state is represented by an entangled isotropic state such 

as 



where \x)i represents a polarization state of the zth photon that is guaranteed to pass through 
a horizontal filter (and guaranteed to be absorbed by a vertical filter) and \y)i represents a 
polarization state that is guaranteed to pass through a vertical filter (and guaranteed to be 
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absorbed by a horizontal filter). In this form it is clear that if photon 1 registers as having 
vertical polarization, photon 2 also turns out to be vertical. In addition, we recognize that 
the twin state is rotationally symmetric by rotating the x and y axes by an arbitrary angle: 



^=^=(\x) 1 \x) 2 + \y) 1 \y) 2 ) (2a) 

= -L(i\ x y iC os6- Mi sin 0] [\xy 2 cos0-\ y y 2 sm0] 

+ [|y)' 1 cos0 + \x)[sm9} [\y)' 2 cos0 + \x)' 2 sin 0]) (2b) 

= ^(k> , ik> / 2 + ll/>ill/> , 2 ), (2c) 

where x' and y' are rotated counterclockwise from x and y by angle 0. The fact that the form 
is the same in the rotated basis indicates that the state is rotationally symmetric. Although 
this representation is not crucial to our study, it satisfies our experimental definition of the 
twin state: no matter what direction we orient our filters, the two polarization measurements 
are always the same. 

The original EPR argument applies to the twin state system as follows. Although initially 
we don't know the polarization of the first photon (it could either pass through or be 
absorbed by its filter), we can determine it by measuring the polarization of the second 
photon. Because by doing this measurement we determine the polarization of the first 
photon without disturbing it, EPR argue that the polarization of the first photon must 
therefore be a well-defined quantity, independent of its measurement. It follows that a 
complete theory should contain an element that specifies this polarization. 

In stark contrast to this view, the orthodox interpretation of quantum mechanics claims 
that the polarizations of the photons are not well defined prior to measurement. Initially 
the two photons are in an entangled superposition of \x) and \y) states. Only when the 
measurement of one of the photons takes place do both photons collapse to well-defined 
polarizations. Quantum mechanics does not precisely specify what the polarizations are for 
a particular pair of photons before they are measured. Quantum mechanics does not even 
admit the definite polarization of either photon as a valid concept in the theory. Although 
the quantum mechanical twin state of Eq. (1) specifies the correlation between photons - 
whatever the polarization of one photon, the other is guaranteed to be the same — it does 
not specify the individual polarization of either photon. Each photon is in an entangled 



superposition of ill-defined polarization. EPR argue therefore, following the principle of 
scientific realism, that quantum mechanical theory fails to specify an element of reality. As 
a physical theory, it is incomplete. 

As is commonly recognized, and as EPR themselves admitted, this argument assumes 
that the measurement of the second photon does not in any way affect the state of the 
(spatially separated) first photon. By this reasoning, EPR assume locality: namely that 
interactions between objects are point-like (local) interactions whose immediate effects are 
confined to a single location, and that actions at one location do not immediately have 
any effect at a separate location. This assumption, when combined with the conclusions of 
special relativity, implies that no effect can travel faster than the speed of light in vacuum.— 

This notion of locality is closely related to (but not exactly the same as) relativistic 
causality, which offers a very powerful logical constraint on theory. Relativistic causality 
states that no thing (neither object nor signal) can travel faster than the velocity of light 
in vacuum. (However, it does not rule out the possibility of some kind of information- 
free superluminal effect, such as quantum collapse.) If objects or information could be 
transmitted faster than the speed of light — if for instance, a nonlocal effect could somehow 
be used to instantaneously transmit a signal — special relativity implies^ that causality 
would be violated, and we would be in danger of a number of logical paradoxes, such as 
accidentally causing the death of our grandmother before we were born. 

Although EPR recognized the locality assumption in their argument, they believed that 
assumption was unassailable. Regarding the possibility that elements of reality could depend 
on non-local effects, they concluded: "No reasonable definition of reality could be expected 
to permit this. — 

III. BELL'S INEQUALITY 

EPR did not dispute the accuracy of the statistical predictions of quantum mechanics. 
They argued that quantum mechanics should describe more than it does; it should precisely 
describe the outcomes of individual measurements. Decades later, John Bell focused on the 
statistical predictions themselves and asked how quantum mechanics could be experimentally 
distinguished from other possible theories of the world. 

Bell's innovation was to consider polarization measurements for twin state photons that 



6 



are not necessarily made at the same angle.— Although such pairs of measurements are not 
perfectly correlated, we should be able to calculate the average statistical correlation. The 
astonishing result is that the correlation predicted by quantum mechanics is not the same 
as the correlation predicted by common sense. A difference exists at the statistical level, 
which can be unambiguously resolved by experiment. 

To follow this line of thought, imagine making measurements of twin state photons with 
polarizing filters oriented at two different angles, 8\ for the first photon and 82 for the second 
photon. What does quantum mechanics predict for the probability of getting the same 
measurement in both cases (either both photons transmitted or both absorbed)? According 
to quantum mechanics, whichever measurement is performed first^ collapses the entangled 
twin state superposition to a single polarization state that is identical for both photons. 
Thus, if photon 2 passes through a filter oriented at angle 82, both photons are collapsed to 
a polarization state along that direction. If we wish to know what the probability is of getting 
the same measurement for photon 1, we need only figure out what the probability is for a 
photon with polarization along 82 to pass through a filter oriented along 8±. This probability 
is easily calculated according to simple trigonometry. Any arbitrary linear polarization can 
be thought of as a superposition of polarization along the 8\ direction (which will pass 
through the filter) and perpendicular to the 8\ direction (which will be absorbed by the 
filter). For a wave polarized along the 8 2 direction, the amplitude component along the 8\ 
direction is given by cos(#2 — #1), and the probability for transmission, given by the wave 
amplitude squared, is cos 2 (8 2 — 81). That is the prediction of quantum mechanics. 

Now let us contrast this prediction with the prediction of common sense. This is where 
Bell comes in, and we apply his logic to our two-photon case.— For measurements with 
filters along the same direction, we know that the probability of obtaining identical mea- 
surement results (both transmitted or both absorbed) is 100%. A typical sequence of paired 
measurements, represented as either absorbed (A) or transmitted (T), might look like the 
following: 
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We always see both photons absorbed or both transmitted, never one absorbed and the other 
transmitted. Similarly, we know that paired measurements with filters oriented perpendic- 
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ular to each other will always give opposite results; whenever one photon passes through its 
filter, the other photon is absorbed. 

Furthermore, it is reasonable to assume that filter measurements differing by intermediate 
angles (between 0° and 90°) will yield intermediate correlations, and that a continuous range 
of angles gives rise to a continuous range of correlation. For example, imagine some angle 
between 0° and 90°, call it a, for which the mismatch between measurements is 25% on 
average (the coincidence rate is 75%). Thus, for filter 1 oriented at 9± — and filter 2 
oriented at angle 9 2 = a, typical sequences might be 
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where boldface entries highlight where there is disagreement between the two sequences. 
On average, we expect 25% of the cases will disagree, and in our illustration, six out of 24 
measurements do so. The same average coincidence rate should be present if 9\ = —a and 
9 2 = (which corresponds to tilting one's head by angle a) so that typical sequences in this 
case might be: 
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Now consider how the sequences should compare if 6\ = —a and 9 2 = a: 
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where we have just copied the previous sequences of measurements from 9\ = —a and 9 2 = a 
to represent typical sequences of measurements (though we do not always measure exactly 
the same sequences, just sequences consistent with the same statistical coincidence rates). 
Because the measurements of the first photon at angle —a disagree with measurements at 
angle zero by 25%, and because the measurements of the second photon at angle a also 
disagree with measurements at angle zero by 25%, we would expect that measurements of 
photon 1 at —a and measurements of photon 2 at a would disagree with each other by an 
average of at most 50%. The disagreement would be a maximum of 50% because there are 
some cases where both measurements disagree with the result at angle zero (highlighted in 
boldface) and therefore agree with each other. Thus, according to this logic, the average 
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mismatch between measurements with 9\ = —a and 62 = a should be less than or equal to 
50%. This constraint is an example of Bell's inequality. 21 

The startling consequence of this logic is that it conflicts with quantum mechanics. The 
"common sense" prediction states that the mismatch rate should be at most 50%, while 
quantum mechanics predicts (and experiment confirms) that the mismatch rate in this case 
is 75%. If we recall that the coincidence rate predicted by quantum mechanics is cos 2 (#2 — #1), 
we recognize that the mismatch rate should be 25% when a = 62 — 9i = 30° (that is, 
cos 2 (30°) = 0.75). When 61 = —30° and 62 = 30°, quantum mechanics predicts that the 
coincidence rate is cos 2 (60°) = 0.25. That is, the mismatch rate is 75%, which is not less 
than the 50% predicted by the common sense argument.— 

Since Bell's publication, measurements that test Bell's inequality have been performed 
many times,- and the experimental results clearly agree with quantum mechanics, and violate 
the inequality. We are therefore forced to reexamine the steps that led to our "common 
sense" conclusions. 

IV. BELL'S ASSUMPTIONS 

Where did common sense go wrong? There are only a few possibilities in our example. We 
might wonder if there really exists an angle a for which the mismatch is 25%. However, its 
existence is experimentally confirmed and the answer agrees with the prediction of quantum 
mechanics (a = 30°). One might also question if the rotated case (#1 = —a and 6* 2 = 0) 
really gives the same 25% mismatch as the original case (#1 = and 6 2 = a), but this case is 
also experimentally verifiable. These two assumptions are beyond reproach; the vulnerability 
in the argument lies in the last step, where we argue there are two possibilities. 

To go from the Q\ = —a, 9 2 = case to the #1 = —a, 9 2 = a case, we must grab hold of 
the second polarizer and rotate it from angle to angle a. In so doing, we implicitly assume 
that this action does not affect what happens at filter 1, and that the sequence for filter 1 
is the same as it would have been without the rotation of filter 2. This assumption is one of 
locality- 
Orthodox quantum mechanics violates this assumption via the process of instantaneous, 
nonlocal collapse. When photon 2 passes through (or is absorbed by) its filter, the wave 
function collapses simultaneously for both photons. Photon 1 is suddenly forced to match 
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the polarization of photon 2, which messes up the coincidence rates. 

We can see how this change in the coincidence comes about by noting that after the 
collapse, the zero-degree measurements, which are crucial for comparison to the 9 1 = a and 
6 2 = —a measurements, no longer agree between photon 1 and photon 2. If we assume, for 
example, that photon 1 passes through its 30° filter and photon 2 passes through its —30° 
filter, the collapsed state is then 1 30°) 1 1 — 30°)2, which can be expressed in terms of 0° and 
90° states as: 

ip= |30°)i| -30°> 2 (3a) 
y |0°>i + ^190°)^ (^V>2 - i|90°) 2 j (3b) 

= §|0°> 1 |0°> 2 - I|90°) 1 |90 o ) 2 + ^|90°) 1 |0°) 2 - ^|0°) 1 |90°) 2 . (3c) 

In this state there is no longer perfect agreement between photons 1 and 2 in the 0°-90° 
basis. The third and fourth terms in Eq. (3c) show possibilities where one photon passes 
through a 0° filter and the other would not. 

Our previous derivation of Bell's inequality hinged on the assumptions that: 

1. #i = measurements of photon 1 are 75% coincident with 62 = —a measurements of 
photon 2; 

2. 8 2 = measurements of photon 2 are 75% coincident with 9\ = a measurements of 
photon 1; and 

3. #i = measurements of photon 1 are 100% coincident with 9 2 = measurements of 
photon 2. 

From these three assumptions, we concluded that 9\ — a measurements of photon 1 are at 
least 50% coincident with 6 2 = —a measurements of photon 2. Now the last assumption in 
that logic has been broken. The 6\ = 62 = coincidence between the two photons has been 
reduced. 

Most likely, this particular resolution of the EPR paradox would have set Einstein's 
magnificent hair on end. It violates locality, one of his most cherished principles. In deference 
to his coiffure, however, we should note that he was right in one sense: causality itself is still 
respected. There is still no way for an object to go faster than the speed of light, and no 
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way to send a faster-than-light signal. Causality and special relativity remain intact because 
wave function collapse is an uncontrollable process. We can measure the polarization of a 
photon after its collapse, but there is no way to control which of the possible polarizations 
will be chosen during the collapse. Alice, by measuring and collapsing one photon of an 
EPR entangled photon pair, cannot in any deliberate way affect the result that Bob will 
get when he measures the other photon. There is no way to use this feature to send a 
faster-than-light signal.— Sending information at faster-than-light speeds violates causality, 
but quantum correlations caused by quantum collapse do not. Mercifully, our grandmothers 
are safe. 

This line of thought has led some authors to assume the laws of physics are nonlocal, but 
this conclusion is hasty, for there is another subtle assumption hidden in the logic. In the 
last step of Bell's experiment, we assumed not only that measurements are purely local, but 
we also assumed a sequence still exists that agrees with both the 9\ = 30° and 9 2 = —30° 
sequences at the 75% level. It is this comparison with the (now hypothetical) 9\ = 9 2 = 
sequence that allows us to connect the 9\ = 30° sequence with the 9 2 = —30° sequence in 
Bell's logic. 

Presumably, if we had made the 9\ = 9 2 = measurements instead of the 9\ = 30°, 9 2 = 
—30° measurements, we would have found such a sequence. Moreover, had we been allowed 
to make the 9\ = , 9 2 = —30° and 9\ = 30°, 9 2 = measurements, we could have verified 
(assuming locality) that the 0° measurements agree with the 9\ = 30° and 9 2 = —30° 
measurements at the 75% level. However, in the absence of making those measurements, 
can we still assume such a sequence exists? 

This last question seems so trivial it is difficult to imagine negating it. Part of the question 
is just inductive reasoning. Can we assume that if we were to repeat the measurements we 
did earlier, we would obtain results that agree with our previous experience, namely that 
9 = and 9 = a are coincident 75% of the time on average? Because this assumption 
is fundamental to predictive science — experiments must be repeatable and they must be 
consistent with each other — we may safely assume it is acceptable. At least if it fails, there 
are more serious problems than just the resolution of the EPR experiment. 

The other more subtle part of the question is if we can still postulate getting a single 
specific well-defined sequence for the 9 = measurements even when we don't actually make 
the measurements. Certainly, hidden variable theory satisfies this condition. Hidden variable 
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theory implies that the polarizations of the photons are well defined (a realist assumption) 
and that they are accurately and precisely described by the extra hidden variables of the 
theory. In fact, if we knew the values of the hidden variables, not only could we imagine 
making the 9 = measurements and obtaining a particular sequence, we could calculate 
precisely what that sequence would be. But this scenario is overkill. It is not necessary to 
assume hidden variables, determinism, or even realism to derive Bell's inequality. 

Determinism is demonstrably unnecessary in our example because we assumed only that 
there is a zero degree sequence that agrees statistically with the previously measured co- 
incidence rate. No specific sequence for the hypothetical zero degree measurements was 
assumed, only that the sequence remains correlated with 9\ = a and 62 = —oc measure- 
ments at the 75% level. The actual sequence in a particular string of measurements could 
still be random. Any theory that produces random specific results (that still agree with the 
statistical averages) would satisfy the assumption. 25 

Neither is scientific realism required in the logic. It might be tempting to think that our 
polarization measurements reveal some underlying reality such as the "actual" polarization of 
the photons. However, our example does not make this assumption. There is no requirement 
for the existence of photons or any other underlying reality. Our example makes assumptions 
only about the statistical correlation of measurement sequences. We could repeat the entire 
argument without ever using the word "photon," making reference only to the behavior of 
the experimental apparatus. Thus, we might have observed that two sequences produced by 
identical filter settings are 100% correlated, and that rotating one filter by angle a produces 
a 25% mismatch with the sequence it would have measured before rotation. It follows that 
two such rotations would produce at most a 50% mismatch. No reference to any kind of 
underlying reality (in particular no realist assumption) need be used. 

The critical assumption in our example (aside from locality) is that we can still postulate 
a single definite sequence for the case where one or both filters are oriented at 0°, even 
when those measurements are not made. We need not obtain any particular sequence, 
and the sequence need not be knowable beforehand; we need only specify a sequence that is 
statistically consistent with previous 0° measurements. In other words, we postulate a single 
sequence for 0° orientations (which is identical for both photons) that is still coincident with 
sequences measured at +30° and —30° at the 75% level. 

This assumption, that we are allowed to postulate a single definite result from an individ- 
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ual measurement even when the measurement is not performed, is known as "counterfactual 
definiteness.' — The role of counterfactual definiteness in the Bell argument was first iden- 
tified by Henry Stapp,— although he claimed to circumvent it in other contexts.— To deny 
counterfactual definiteness means that we are not necessarily allowed to consider a definite 
result for an experiment that we have not performed. Stapp originally imagined that the 
way to violate counterfactual definiteness might be through a kind of "super determinism," ' 
where the action of the experimenter is already predetermined, and alternative measure- 
ments are not allowed (or even imaginable). In practice, this approach is no different from 
traditional determinism taken to its logical conclusion. However, this view of counterfactual 
definiteness is too limited. There is a way to violate it without invoking determinism. 

The key is to allow more than one possibility for the potential result of a measurement. 
Orthodox quantum mechanics embraces this notion of multiple possibilities whenever a 
quantum state is in a superposition. In the absence of measurement (and collapse), there 
is no single definite potential result. Instead, there are many potential results represented 
by many components of the superposition. Moreover, the principle of complementarity 
guarantees that this situation arises frequently. Complementarity describes how certain 
pairs of measurements, such as position and momentum measurements or orthogonal spin 
measurements, cannot be made simultaneously. Thus, a definite momentum measurement 
forces the physical system into an indefinite superposition of distinct position eigenstates. 
Likewise, the potential result for one polarization measurement becomes indefinite when 
making another (non-commuting) polarization measurement. In our example, the unrealized 
Q\ = 02 = polarization measurements were rendered indefinite (a superposition of four 
possibilities) by the act of the Q\ = 30° and 82 = —30° measurements. This violation of 
counterfactual definiteness, together with the violation of locality, gave us the collapsed state 
in Eq. (3), which demonstrated the reduced coincidence between photons. 

For orthodox quantum mechanics, both locality and counterfactual definiteness are vio- 
lated, leading naturally to a violation of the Bell inequality. However, it can be confusing to 
unravel the separate effects of each assumption as they pertain to the Bell experiment. It is 
possible to violate Bell's inequality using either nonlocality or counterfactual indefiniteness 
alone, and there are examples of each approach. To better understand the role of counter- 
factual indefiniteness, it is instructive to examine an interpretation of quantum mechanics 
that relies solely on counterfactual indefiniteness to violate the inequality. One of the most 



13 



popular of these is the "many worlds" interpretation. 

V. MANY WORLDS 

In 1957, Hugh Everett conceived of the "relative state" interpretation of quantum 
mechanics,- 8 which was later popularized by Bryce DeWitt22 as the "many-worlds" inter- 
pretation and further developed by others. The many-worlds interpretation differs from 
orthodox quantum mechanics by assuming that collapse never happens; superpositions per- 
sist forever. The observer, instead of collapsing the measured system to a single result, 
becomes entangled with the system and is herself described by a more inclusive, entangled 
wave function with multiple components (that is, "branches"). Nonetheless, to the observer 
in a particular branch, who is unaware of other states in other branches, it still looks as 
if each measurement produces a single result. In this way, the many- worlds interpreta- 
tion eliminates the nonlocal collapse of orthodox quantum mechanics, while retaining the 
counterfactual indefmitess of superpositions. 

Some features of the many-worlds interpretation are controversial. For our purposes, the 
most pertinent issue is how to incorporate the idea of probability as represented by the Born 
rule in orthodox quantum mechanics, which is relevant for our discussion of the statistical 
correlation between photon polarization measurements. Everett claimed to have answered 
this challenge by defining probability in terms of the sequences of measurement results seen 
by observers along the different branches. In Everett's view, each sequence of polarization 
measurements in each branch is assigned a weight to "make quantitative statements about 
the relative frequencies of the different possible results of observation - which are recorded 
in the memory — for a typical observer state."— According to Everett, this weight is repre- 
sented by the same wave function amplitude that occurs in orthodox quantum mechanics. 
Thus, although all sequences are present in the many-worlds interpretation, even the rare 
ones, they have the same relative weights as corresponding sequences in orthodox collapse. 

This approach to probability in the many-worlds interpretation has been deemed inad- 
equate by several authors, and some have tried to address the perceived inadequacies of 
Everett's approach.— For instance, Deutsch^ and Wallace^ have pursued decision-theoretic 
methods to deduce the many-worlds probabilities in terms of the experience of a "rational 
decision maker."— In contrast, Zurek— has developed an objective approach to the many- 
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worlds interpretation probability (that is, not subjective to an observer) based on interaction 
with the environment as a whole, and making use of certain symmetries of entangled states. 

It is not the intent of this paper to review the full debate over the many-worlds interpre- 
tation. Here we merely wish to note that consistency with observer experience is possible 
according to many authors, and that collapse-free interpretations such as the many-worlds 
interpretation can offer a useful perspective for exploring the logic behind Bell's argument. 
It is also worth noting that following Bell's original work,- there have been a number of 
Bell-like constraints on theory that do not involve inequalities, but instead make precise 
predictions for common sense theories.— If we were to pursue any of these experiments, we 
might claim the discussion of probability is not directly relevant because there is no reference 
to statistics in the argument. For pedagogical reasons we prefer to focus on the original Bell 
test and how the many- worlds interpretation might satisfy it, assuming the notion of statis- 
tical correlation makes sense in this context. We claim that the many-worlds interpretation 
passes the Bell test by violating counterfactual defmiteness, while still respecting locality. 

First of all, we argue that after eliminating the nonlocal collapse of orthodox quantum 
mechanics, the many-worlds interpretation can be formulated as a local theory. In particular, 
the correlated entangled states used in EPR-Bell experiments can be produced via purely 
local processes. For instance, two photons with entangled polarizations might be produced 
from the decay of a parent particle. In this case the entangled state is produced at one 
location, where the parent decays, and its immediate effects are limited to that one spacetime 
point. Thereafter, the photons may go their separate ways, and as they separate they carry 
the correlation to separate locations. It is the original correlation produced at a single 
location that guarantees measurements will always match in any experiment in any branch 
where observers compare notes. In this respect the spread of the correlation to distant 
locations is akin to the delivery of newspapers, where a common story is generated at a 
central location and disseminated all over the neighborhood. In the many-worlds context, 
however, different branches (which originally split at a common location) carry different 
editions of the newspaper. 

In the orthodox interpretation nonlocality appears only when we measure the state of 
one of the entangled photons. Then and only then, does the state of the other distant 
photon change in a nonlocal fashion as a result of collapse. In contrast, a measurement in 
the many-worlds interpretation still involves entanglement — the measurement apparatus 
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entangles with the object under study thereby branching into an entangled superposition - 
but it does not involve collapse, and it remains a purely local process. Measurement induces 
local branching based on the local measurement result, but it does not cause branching at 
any distant location. As an example of this thinking, the many-worlds explanation of Bell's 
experiment 36 argues that when measurements are made on a pair of photons with 9% = 30° 
for filter 1 and # 2 = —30° for filter 2, the result is a superposition of four terms: 

i> = ^(30°, ?)£ 2 (?, — 30°)|30°> 1 | - 30°> 2 + ^(120°, ?)£ 2 (?, 60°)|120 o } 1 |60°} 2 

+ £i(30°, ?)£ 2 (?, 60°)|30°)i|60°) 2 + ^(120°, ?)£ 2 (?, -30°)|120°)i| - 30°) 2 , (4) 

where i?i(30°, ?) indicates that experimenter 1 has identified photon 1 as being polarized at 
30° (it has passed through a 30° filter) but does not yet know the polarization of photon 2, 
and similarly for the other terms. At first glance it would seem that each experimenter 
has branched into four versions - one branching as a result of the local measurement of 
his/her own photon and another branching as a result of the distant measurement of the 
other photon. However, this view is mistaken, for only local branching has really occurred. 
To illustrate the local nature of that branching, Eq. (4) can be factored into two terms 
according to: 

tP = ^ 1 (30 o ,?)|30 o ) 1 ( J E 2 (?,-30°)| -30°> 2 + £ 2 (?,60 o )|60°> 2 ) 

+ ^(120°, ?)|120°)i (£ 2 (?, -30°) | - 30°> 2 + E 2 (?, 60°)|60°) 2 ) , (5) 

which indicates that experimenter 1 has split into only two distinct branches according 
to his local measurement of photon 1. He is not yet entangled (or branched) with the 
result from photon 2, and has no knowledge of it. Similarly, there are only two branches 
for experimenter 2, corresponding to her local measurement of photon 2. After the two 
experimenters communicate their results to each other, each experimenter is finally split 
into four distinct branches corresponding to the four two-photon states: 

ip = £i(30°,-30 o )£ 2 (30 o ,-30 o )|30°)i| - 30°) 2 + ^(120°, 60 o )E 2 (120°, 60°)|120°)i|60°) 2 
+ (30°, 60°)£ 2 (30°, 60°)|30°)i|60°) 2 + £i(120°, -30°)£ 2 (120°, -30°)|120°)i| - 30°) 2 , 

(6) 

but this final splitting occurs only following a chain of local communications at sublight 
speed. In this context we see that entanglement and branching in the many-worlds inter- 
pretation are local, point-like operations. 
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If we accept that locality is preserved^! in the many-worlds interpretation, how does the 
many-worlds interpretation violate Bell's inequality and satisfy the experimental test? We 
claim that violation of counterfactual defmiteness is the key. By keeping superposition and 
eliminating collapse, the many-worlds interpretation avoids postulating a single result from 
a measurement (whether or not the measurement is actually made). When a polarization 
measurement is made, both results (photon transmitted and photon absorbed) exist in 
separate branches. Postulating a single sequence for a series of polarizations (measured or 
not) is not valid, and counterfactual defmiteness is violated. 

By taking this view, the many-worlds interpretation violates counterfactual defmiteness 
while preserving locality and satisfying the Bell experiment. Perhaps the best way of seeing 
how this interpretation works in the Bell experiment is to look again at the algebra we used 
for conventional quantum mechanics. When making a measurement with filter 1 at 30° 
and filter 2 at —30°, four different results are possible (each photon is either transmitted 
or absorbed). The many- worlds interpretation posits that each of the four results persists 
as a separate component of the superposition. Taken together, the four two-photon states 
in the four branches comprise the twin state from whence they originated. However, when 
we consider a particular experimental result, we focus on a single branch, and within this 
branch the two photons are not in the twin state. For instance, for the branch in which 
photon 1 passes through a 30° filter and photon 2 passes through a —30° filter, the accessible 
two-photon state is the one in Eq. (3), just as it is for orthodox quantum mechanics. 

Within this branch, the two-photon state is in a superposition of four possibilities for 
Q\ = 6 2 = measurements, as the four terms of Eq. (3c) demonstrate. The last two of these 
terms are cases where the result of the 0\ = measurement differs from the result of the 
02 = measurement. Thus, in this branch there is no longer a single possibility for 0\ = and 
02 = measurements that agrees for both photons. In the absence of making the 0\ = 02 = 
measurements, we are not allowed to assume a single definite result for those measurements, 
contrary to the principle of counterfactual defmiteness. The alternative possibility where 
the experimenters make 0\ = 02 = measurements that always yield the same results for 
photon 1 and photon 2 is a counterfactual hypothesis, which is not valid in this branch. 

In the many-worlds interpretation, in the absence of collapse, there are four branches 
not one, but there is a loss of correlation in each of the four branches, just as there is for 
the single result of orthodox quantum mechanics. What is true for the branch described by 
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Eq. (3) is true for each of the other three branches. By choosing to look at the Q\ = 30° 
and 6*2 = —30° measurements instead of the 9\ = 62 = measurements, we have invalidated 
our previous assumption about the 100% correlation of the (now counterfactual) 6\ = and 
6 2 = measurements. From the point of view of the many-worlds interpretation, we can say 
that the 9 2 = sequence that would agree with the 61 = 30° sequence at the 75% level and 
the #i = sequence that would agree with the 62 = —30° sequence at the 75% level cannot 
occur in the same branch, and therefore should not be expected to agree with each other. 

VI. CONCLUSIONS 

We have argued that the minimal assumptions needed to derive the Bell inequality are 
locality and counterfactual definiteness. Because nature violates Bell's inequality, we are 
forced to admit that nature's laws cannot obey both of these assumptions. 

Furthermore, we should note, as did Bell,— that the perfect correlations present in EPR 
experiments imply that local theories must also be deterministic. This conclusion rests 
on the idea that a theory with only local interactions precludes construction of the EPR 
correlation at a distance, and therefore the correlation must originate at a common point 
and propagate outward. For the perfect correlation to propagate undisturbed to separate 
locations, the theory must also be deterministic. Thus, taking the results of EPR and Bell 
experiments together, we conclude either that valid physical theories are nonlocal, or they 
are local, deterministic, and counterfactually indefinite. 

Hidden- variable theories obey counterfactual definiteness. They say not only that specific 
results to hypothetical experiments can be assumed, but that the values of the hidden 
variables describe those specific results. In this way, hidden- variable theories define a single- 
valued realism. Therefore, the only way in which a hidden-variable theory can satisfy Bell's 
experiment (and violate the inequality) is through a nonlocal effect. This approach describes, 
for example, David Bohm's "ontological" interpretation of quantum mechanics, 39 which is a 
nonlocal hidden- variables theory. 

The EPR "elements of reality" can be considered a form of hidden variables, at least for 
the purposes of deriving Bell's inequality. EPR imagined that each "element" corresponds 
to a single specific trait of a physical system. In particular, they did not imagine multiple 
possibilities (that is, "superpositions') like those in the many-worlds interpretation or in 
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orthodox quantum mechanics. Thus, the definition of "elements of reality" assumed not 
just scientific realism, but actually a single-valued realism that guarantees definiteness as 

The many-worlds interpretation of quantum mechanics satisfies the Bell test in a pro- 
foundly different manner. It allows the laws of physics to remain local, at the cost of 
violating counterfactual definiteness. One may not like the many-worlds interpretation for 
several reasons (and this author might agree), but it does provide an example in which 
locality can survive. The many-worlds interpretation is thus realist (in the sense that super- 
positions can be regarded as real entities and that every possible measurement result exists 
in some branch), deterministic (the superpositions evolve according to a deterministic wave 
equation), and local (involving only point-like interactions), but counterfactually indefinite. 
In this case the multiple possibilities described by the superposition preclude a single def- 
inite possibility, and thus provide the means for violating counterfactual definiteness. The 
many-worlds interpretation is not only counterfactually indefinite, it is factually indefinite 
as well. Even when measurements are actually performed, many different results can exist 
in a multitude of different branches. A single definite result is not guaranteed. 

The orthodox interpretation violates Bell's inequality by violating both counterfactual 
definiteness (via superposition and complementarity) and locality (via collapse). It there- 
fore encompasses two methods for getting around Bell's constraints. Incidentally, this in- 
terpretation is also non-deterministic and non-realist (the state of a physical system is not 
necessarily defined until it is collapsed by measurement), but these features are not critical 
in Bell's analysis. 

Since Bell's original work, a number of other "common sense" constraints on physical 
theory have been derived,— based on similar thought experiments. However, these deriva- 
tions rest on assumptions at least as strong as what is in Bell's argument, and their violation 
doesn't seem to constrain physical reality any further. 42 Some researchers are now focused on 
discovering new constraints based on assumptions that are distinct from Bell's assumptions. 
Leggett^ has made recent progress in this area by studying a class of nonlocal hidden- 
variable theories, called "crypto-nonlocal" theories, which make some experimental predic- 
tions that differ from those of quantum mechanics even though they agree on the Bell test. 
Groblacher et al.— have implemented a version of Leggett's theorem in a real experiment 
and have ruled out this class of nonlocal hidden-variable theories. In a different approach, 
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Tegmark has written about an experiment he calls "quantum suicide," which might be used 
to distinguish between the many-worlds interpretation and other interpretations in which 
measurements produce single outcomes.— 

It appears we have not yet reached the end of research in this field. Researchers are still 
developing testable theorems about the nature of reality, allowing us to introduce science 
into what was once the realm of pure philosophy. On this note, we can look forward to a 
new generation of debate about physical reality that follows in the footsteps of John Bell. 
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example only to discuss locality, and does not offer an analysis of any other assumptions in the 
logic. 

Our example of Bell's inequality derives from comparison of measurements at 0° and ±30°. A 
more general result can be derived for correlations between any arbitrary three angles. Take 
C (61,62) to represent the comparison between measurements of the two photons at angles 6\ 
and 62, with C = — 1 identifying a mismatch (one transmission and one absorption) and C = 1 
identifying a coincidence (both absorbed or both transmitted). We can verify that in general 
1 + C(6 1 ,6 3 )C(6 1 ,6 2 ) + C(6 2 ,6 3 ). If C(6 1 ,6 2 ) = C(6 2 ,6 3 ) = 1 (measurements at the three 
angles are either all absorbed or all transmitted), then C (61,63) = 1 as well and both sides 
of the inequality equal 2. If either 0(61,62) = —1 or 0(62,63) = — 1, then the left side of the 
inequality is at most zero and the right side of the inequality is at least zero. By taking averages 
over many measurements we have 1 + <C (61, 6 3 )><C (6\, &2)> + <C (62, 6 3 )>. In the special case 
where <C(6i,6 2 )> = <C(6 2 ,6 3 )> = 0.5 (that is, 25% mismatch), we get <C(6 1 , <9 3 )>?0 (that 
is, greater than 50% coincidence), which agrees with our example. There is a sign difference 
between this result and Bell's original inequality (see Ref. \£) only because his result was derived 
for singlet state fermions, while our result applies to twin state photons. 

The case a = 30° gives the maximum difference between quantum mechanics and the inequality 
limit from Bell's theorem. 

In fact, there are locality assumptions throughout the argument. Initially, we considered mea- 
surements with 6\ = 62 = to verify perfect coincidence between the two photon measurements. 
Next we rotated filter 2 to angle a to verify (assuming we did not affect the sequence at filter 1) 
that the 62 = a sequence coincides with the 6\ = sequence at the 75% level. Alternatively, 
we rotated filter 1 to 6\ = —a to verify (assuming we did not affect sequence at filter 2) that 
the 6\ = —a sequence coincides with the 62 = sequence at the 75% level. Every step in our 
example involved an implicit locality assumption. 

The case against superluminal signaling also rests on the assumption that Bob's entangled pho- 
ton cannot be copied. This feature of quantum mechanics, known as the "no-cloning theorem," 
was proven in W. K. Wootters and W. H. Zurek, "A single quantum cannot be cloned," Nature 
299, 802-803 (1982). 

Bell emphasized that determinism was not a critical assumption when he published a more 
general proof of his inequality based on assumptions of local distributions in hidden- variables. 
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